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Summary 


Maximum  likelihood  estimation  of  the  parameters  X  and  y 
of  a  simple  (linear)  birth-and-death  process  observed  continuously 
over  a  fixed  time  interval  is  studied.  Asymptotic  distributions 
for  large  initial  populations  and  for  large  periods  of  observation 
are  derived  and  some  nonstandard  results  appear.  The  related  problem 
of  estimation  from  the  discrete  skeleton  of  the  process  is  also 
discussed. 
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1.  Introduction 

Let  be  the  population  size  at  time  t  of  the  (linear) 

birth- and- death  process,  that  is,  the  Markov  process  in  which 


p<xt+h 
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iAh  +  o(h) , 

1  -  i(A+y)h  +  o(h) 


iyh'  +  o(h ) , 
, o(h) , 


j=i+l , 

J'=i, 

otherwise , 


i=0,l,2,...,  A  0,  y  >  0,  and  assume  throughout  that  X^  is 
degenerate  at  some  x^  >  0.  We  shall  consider  maximum  likelihood 
estimation  of  the  parameters  A  and  y  assuming  that  the  process 
has  been  observed  continuously  over  some  time  interval . 

The  maximum  likelihood  estimators  are  the  occurrence-exposure 

A  A 

rates  A  =  B^/S^,  y  =  D^/S  ,  and  being  the  number  of 

births  and  deaths  and  S,  =  /^X  du  the  total  time  lived  by  the 

t  0  u 

population  in  the  time  interval  [0,t],  The  sampling  properties 
of  these  estimators  have  been  studied  by  a  number  of  authors,  to 
be  referred  to  in  Section  2,  who  assumed  various  stopping  rules 
depending  on  the  number  of  births  and  deaths. 

In  this  paper  the  sampling  properties  of  the  estimators  will 
be  studied  under  the  assumption  of  a  fixed  interval  [0,t]  of 
observation.  Exact  results  are  as  yet  scarce;  see  Section  2. 
Asymptotic  results  for  large  initial  population  sizes  are  standard 
and  stated  in  Section  3.  Asymptotic  results  for  large  t  are 
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studied  in  Sections  4  and  5»  Two  aspects  of  the  birth-and-death  process 
cause  novel  features,  First,  if  A  >  y ,  X  •>  0  ("becomes  extinct") 

xo 

with  probability  (y/X)  and  X^  -*■  00  otherwise,  whereas  if  X  <_y, 

X  ■  ■+  0  a, So  Accordingly,  Section  k  gives  asymptotic  results  conditioned  on 
X  00  and  Section  5  results  conditioned  on  extinction.  Secondly,  given 
X,  ->  oo,  X./E(X,  )  ->  W  a, s.  where  W  is  nondegenerate.  Asymptotic 
normal  theory  no  longer  holds  but  is  replaced  by  "Student "-distribution 
results.  The  proof  uses  a  generalization  of  Lamperti’s  random  time 
change  transforming  the  birth-and-death  process  into  a  compound  Poisson 
process,  and  then  transforms  the  time  scale  back  in  a  Billingsley  (1968)- 
type  approach. 

Finally,  estimation  under  the  assumption  that  the  process  is  only- 
observed  at  equidistant  points  of  time  (the  so-called  discrete  skeleton) 
is  discussed  in  Section  6  and  the  results  are  shown  to  improve  the  early 
work  of  Inline  1  (l95l)  and  Darwin  (1956),  Furthermore,  it  is  pointed  out 
that  by  making  the  discrete  skeleton  infinitesimal,  the  results  for 
continuous  observation  are  recovered. 

The  results  are  related  to  recent  work  by  Dion  (1972)  on  estimation 
in  the  Galton-Watson  process  and  results  by  Jagers  t 1973c)  on  estimation 
of' the  offspring  distribution  of  a  Bellman-Harris  process. 

The  particular  cases  y  =  0  (the  pure  birth  process)  and  X  =  0 
(the  pure  death  process)  have  been  studied  previously,  Keiding  (197^) 
gave  results  for  the  pure  birth  process,  using  different  proofs,  and 
Beyer,  Keiding  and  Simonsen  (forthcoming)  give  exact  and  L^-convergence 
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results  for  the  pure  birth  process  and.  the  pure  death  process  as  well  as 
a  numerical  evaluation  of  the  asymptotic  results  (for-  these  particular 
processes),  given  in  Sections  3  through  5  of  the  present  paper. 

The  literature  on  estimation  in  the  pure  death  process  is  vast, 
this  problem  occurring  in  a  variety  ..of  life-testing  situations.  We 
shall  not  attempt  to  review  this  literature  but  call  attention  to  the 
review  by  Cox  (1965)  and  a  paper  by  Wolff  (1965)  on  estimation  in 
birth-and- death  processes  of  queueing  theory.  Further  specific  references 
are  given  in  Sections  2  and  3. 

The  asymptotic  results  in  Sections  3  through  6  specialize  in  an 
obvious  way  to  the  pure  birth  and  pure  death  processes.  We  shall  not 
state  this  specialization  explicitly  in  each  case  and  will  therefore 
assume  through  those  Sections  that  X  >  0  and  y  >  0. 
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2 .  Maximum  likelihood  estimation  from  continuous  observation 


Theorem  2.1  The  likelihood  function  is  proportional  to 

\  D  -(A+y)S, 

L(A»y)  =  A  y  e  > 

where  B  and  D^  are  the  number  of  births  and  deaths ,iregpectivfeiy , 
and-  St  =  X^du  is  the  total  time  lived  in  the  population  during 
[0,t].  is  minimal  sufficient  and  the  maximum  likelihood 

A  /\ 

estimators  are  given  by  A  =  B^/S^  and  y  =  D^/S^  . 

Remark .  =  B^  +  D^_  is  to  be  understood  ■  as  the  number  of  discontin¬ 
uities  of  X  ,  0  <  u  <  t.  At  B,  of  these.  X  jumps  +1,  at  D, 

of  them,  X  jumps  -1.  Thus  B  and  D  depend  on  {X  |0  _<  u  t} 

only  and  it  is  seen  that  X^_  -  xQ  =  B^  -  D^ . 

Proof.  The  likelihood  function  seems  to  have  been  derived  first  by 
Darwin  (1956).  The  other  results  are  immediately  derived  from  the 
likelihood  function. 

Remark .  The  characteristic  function  and  some  other  results  concerning 
the  distribution  of  the  minimal  sufficient  statistic  were  given  by 
Puri  (1968),  For  discussions  of  the  distribution  of  jj  when  A  =  0 
see  Hoem  (1969b)  and  his  references. 

/v 

Exact  and  approximate  small-sample  properties  of  A  in  the  pure 
birth  process  (y  =  0)  and  of  y  in  the  pure  death  process  (A  =  0) 
are  given  by  Beyer,  Keiding  and  Simonsen  (forthcoming). 

Alternative  Stopping  Rules .  Stopping  rules  making  the  observation  time 
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t  a  random  variable  have  been  studied  by  a  number  of  authors  .  Thus 
Moran  (1951,  1953)  observed  conditional  on  and  on  the  particular 

sequence  of  births  and  deaths  (thereby  avoiding  untimely  extinction). 

Kendall  (1952)  observed  conditional  on  =  x^  and  Bartlett  (1955, 

Sec.  8.3)  described  briefly  observation  conditional  on  N  or  D . 

"C  "C 

In  all  such  cases,  the  likelihood  function ,;  and  hence  the  maximum 

likelihood  estimators,  properly  interpreted,  remain  the  same  (which, 

it  seems,  was  not  always  realized  by  these  authors)  but  the  sampling 

properties,  of  course,  differ.  Anscombe  (1953)  studied  sequential 

estimation  with  the  criterion  that  X  -  y  be  estimated  with  a  prescribed 

small  standard  error  a.  He  obtained  the  stopping  rule:  observe  until 

S^_  >_  /a.  (Some  device  must  be  prescribed  to  avoid  extinction  before 

then,)  Asymptotically  (as  N  -*■  »)  unbiased  estimates  of  X  +  y  and 

1 /P  —1 /P 

X  -  y  were  obtained  as  aN  '  and  a(B  -D, )W  7  ,  respectively.  We 

*G  t  t  U 

remark  that  since  under  this  stopping  rule  we  may  substitute  for 

l/2 

/a,  these  estimators  are  nothing  but  N^/S^  and  (B^-D^.  )/S^  ,  or 

the  maximum  likelihood  estimators  once  again. 

The  point  is  perhaps  best  illustrated  by  the  reparametrization 

( 4> , © )  =  (X  +  y,y/X).  Then  the  likelihood  function  will  read 
Nt  ~<f>S  D  N 

(J)  e  0  (l  +  0)  so  that  <J)  and  8  are  L-in  depen  dent ,  and 
furthermore,  given  N  ,  is  S-sufficient  for  <j>  and  S-sufficient 

for  0  (in  Barn dor ff-Ni els en ' s  (1971)  terminology).  The  probabilistic 
interpretation  of  this  reparametrization  is  that  <j)  governs  the  "split 
time"  process  (Athreya  and  Ney  (1972))  and  0  the  imbedded  random  walk,  cf. 
Moran  (1951,  1953). 
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3.  Asymptotic  results  .for  large  populations . 


The  birth-and-death  process  -with  =  x^  can  be  interpreted  as 
the  sum  of  independent  birth-and-death  processes  with  the  same 

parameters  and  x^.  =  1  „  The  following  asymptotic  results  for  x^ 
and  fixed  t  may  therefore  be  obtained  from  standard  asymptotic 
maximum  likelihood  theory. 

Theorem  3.1  As  x^->  oo9  (x,y)  (A,y)  a.s.  and 


the  factor  in  {  }  being  replaced  by  x^t  when.  A  =  y. 
Proof.  It  was  shown  by  Puri  (1968)  that 


1  B  \ 

I  x\ 

t 

1 

K 

= 

y 

\S,  j 

i1! 

\ 1 1 

(A  -  y)t 

,f. .  _ _  i  ~  .  x 

A  -  y  0 


(appropriately  modified  when  A  =  y) ,  and  since  by  the  strong  law  of 

large  numbers  B^/x^  "*■  D^/xq  ^t  311(1  ^t^X0  ®®t  as  xo  00 » 

the  strong  consistency  follows.  Asymptotic  normality  follows  from 
standard  theory,  we  need  only  compute  the  information  matrix.  But 


- D^logL 

-DxVogL 

-2 

V  0 

1 

2 

„  -2 

\-DpDAlogL 

-DplogL  j 

°  V 

i 

7 


and  the  result  follows  by  taking  the  expectations. 

Remark.  Sverdrup  (1965)  gave  a  careful  study  of  similar  properties 
for  related  processes,  cf.  also  Kendall  (19^9)  and  Hoem  (1969a,  19Tl) . 
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Up  Asymptotic  results . for  large  periods  of  observation,  given  non-extinction 


In  the  supercritical  case,  that  is,  when  X  >  y,  it  is  well  known  that 

X  /E(X+)  =  Xt/{xQexp(X  -  y)t}  ->  ¥  a.s.  as  t  ->  «=,  where  P{¥  =  0}  = 

,x0 

P{X^_  ->  0}  =  (y/X)  and  the  distribution  of  W,  given.  W  >  0,  is  gamma 

-1  xo  xcf1  "xow 

(x0,x0  ),  that  is,  has  the  density  xQ  w  e  /T(x  ),  w  >  0  (Harris 
(1963) )«  Similar  results  hold  for  a.s.  convergence  of  the  minimal 
sufficient  statistic  which  implies  the  following  consistency  result. 

Theorem  4 .1 


(a)  As  t  -> 


a.  s  . 


(b)  As_  t  -*■  03 ,  (X,y)  ->  (X,y)  a.s.  on  the  set  {X^  -*■  <»} . 

Proof,  (b)  is  a  corollary  of  (a).  To  prove  (a),  we  may  use  lagers' 
(1973a)  results  on  almost  sure  convergence  of  random  functionals  of 
general  branching  processes.  In  fact,  the  birth-and-death  process  is 
a  general  branching  process  with  Malthusian  parameter  X  -  y,  life- 
length  distribution  function  L(x)  =  1  -  e~^X  and  expected  reproduction 
process  given  by  the  density  Xe  ^xdx.  In  the  case  x^  =1,  (a)  is  now 
obtained  directly  from  lagers'  Corollaries  1,2,  and  5,  and  the  general¬ 
ization  to  Xq  >  1  is  immediate. 

The  asymptotic  behavior  of  the  estimators  on  the  set 
{W  =  0}  =  {X  ->  0}  is  described  in  Section  5  below. 


Theorem  4.2  As  t 


CO  ^ 


( 

in  the  conditional  distribution,  given  W  >  0,  where  A,  B  and  ¥ 
are  independent ,  A  and  B  are  normal  (0,1)  and  W  is  gamma 
(x0,x0  ). 

Proof.  The  almost  sure  convergence  of  S  ,  properly  normalized,  was 

111  U 

shown  in  Theorem  4.1. 

The  asymptotic  normal  distribution  is  obtained  from  asymptotic 

normality  in  a  certain  compound  Poisson  process  which  is  converted  into 

a  birth- and- death  process  by  a  random  time  change,  and  then  applying 

Billingsley  (l968)-type  results  to  verify  that  the  asymptotic  normality 

holds  after  the  random  time  change. 

Let  a  compound  two-dimensional  Poisson  process  be 

defined  in  the  following  way.  At  each  event  of  a  Poisson  process 

with  intensity  A  +  y  the  two-dimensional  cluster  size  (M,N)  =  (l,0) 

or  (0,l)  with  probability  A/(A  +  y)  and  y/(A  +  y),  respectively. 

Then  if  (M.  i=l,2,...  are  independent  replications  of  (M,N), 

Ut 

(Qt,Rt)  =  (Ip  M± ,  N.).  Obviously  E(M,N)  =  (A/(A+y) ,y/( A+y))  so 

that  E(Q^,R^)  =  (At,yt)  and  similarly  it  is  seen  that 


S J/2(a  -  x)x-1/2 


1)1 


-1/2 


-  y;y 

(A  -  y)S+e_(A  "  y)t/xn/ 


w 
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j  V(Bt)  =  (A  +  y)tE(M2)  =  At,  V(Dt)  =  yt  and  Cov(B  ,Dt)  = 

(A  +  y)tE(MN)  =  0.  Since  (Q^R^)  has  independent  increments, 

it  follows  by  the  central  limit  theorem  that  ( (Q  -  At ) ( At)-"^2 , 

“l/2 

(R^  -  yt)(yt)  )  is  asymptotically  two-dimensional  normal  with 
mean  zero  and  variance  maUrix  the  identity  . 

/tATo 

Let  -  R^  +  Xq  and  define  by  L t  =  J0  KL  in, 

u 

where  Tn  =  inf{tiK  =  0}.  Then  KT  is  a  birth-and- death  process 
°  Lt  Lt 

with  parameters  A  and-  ys  which  may  be  seen  in  a  similar  way  as  for 
|  the  Lamperti  representation  of  the  pure  birth  process  (Athreya  and 

I 

Ney  (1972),  Theorem  111,11.1).  Furthermore  QT  and  RT  correspond 

L .  L . 

i  t  t 

to  B,  and  D.  as  defined  above,  and  L,  to  S .  , 

!  t  t  ’  t  t 

Replace  now  t  by  L  in  the  asymptotic  'result  above  to  get 

f  u 

1  ((Ql  -  ALt)  (ALt)"l/2,(RL  -  yLt)(yLtrx/2)  which  in  light  of  the 

t  t 

I  interpretation  as  birth-and-death  process  has  the  same  distribution  as 

(S^2(A  -  A)A-1^2,  S^2(y  -  y)y”1/2).  The  proof  that  the  asymptotic 
j  normality  will  still  hold  after  the  random  time  change  t  -*■  L  on  the 

l  "C 

set  (L  -*■  °°}  is  now  similar  to  Dion’s  (1972)  proof  for  discrete  time, 
cf.  also  Jagers  (1973b).  From  this  proof  we  also  conclude  that  the 
asymptotic  normal  distribution  is  independent  of  W  as  stated. 

Remark .  This  method  of  proof  will  yield  a  series  of  central-limit 
type  theorems  for  Markov  branching  processes „  Such  theorems  will  be 
useful  counterparts  to  central  limit  results  e_,g_,  of  the  type  stated 
by  Athreya  and  Ney  ( 1972 3  Sec.  III. 10). 
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Corollary,  As  t  ->  <», 


[xn(ea"y)t-l)/(A-y)]1/2 


(X-A)A“1/2 


(y-y)y 


-1/2 


w 


AW 


i  BW 


■1/2' 

-1/2 


in  the  conditional  distribution,  given  ->.  <».  The  limiting  distribution 
is  bivariate  Student  with  common  denominator  (cf.  Johnson  and  Kotz,  1972, 
p.  134),  that  is ,  each  component  is  Student  with  2x^  d.f.  and  the  com¬ 
ponents  are  independent  for  given  W. 

Remark .  As  x^  ->  oo  in  the  Corollary  above ,  the  limiting  distribution 
tends  towards  the  two-dimensional  standardized  normal  in  accordance  with 
the  result  in  Theorem  3.1. 

Remark.  The  results  of  Theorem  4.2  and  its  Corollary  will  still  hold  if 
considered  in  the  conditional  distribution  given  {X^  >0}  instead  of 
{Xt  ->■  oo} .  By  this  remark,  which  is  parallel  to  one  made  by  Dion  (1972), 
approximate  confidence  limits  may  be  obtained. 
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5 „  Asymptotic  results  for  large  periods  of  observation  given 
ultimate  extinction 

If  X,  ■+  0  as  t  »  which  happens  with  probability  (y/l) 

"C 

in  the  supercritical  case  A  >  y  and  almost  surely  otherwise,  the 
consistency  of  the  estimators  no  longer  holds,  since  the  sample  will 
be  in  effect  finite  as  t  «. 

Since  for  a  supercritical  process  with  A  >  y9  the  conditional 
distribution,  given  that  X  0 ,  is  identical  to  that  of  a  birth-and- 
death  process  with  birth  parameter  y  and  death  parameter  X  (Waugh 
1958),  the  results  in  the  present  section  are  relevant  for  supercritical 
processes,  given  extinction . 

A  A 

Theorem  5„1  For  X  ^  yi3  (  A,  y)  (B/S,D/S)  a,s.,  as_  t  ->  <»,  where 
B  and  D  are  the  total  number  of  births  and  deaths  until  extinction 

foo 

X,  dt  o 
"O  t 

The  distribution  of  (B,D,S)  is  given  by  the  density 


'0 


bid* 


b  0,1,2,.  .  . ,  d=  ^+Xq ,  s  .jl 0 » 

Proof.  Most  of  the  results  are  immediate.  The  distributions  of  N 
and  of  S  given  N  were  given  by  Puri  ,  (1963)  „  Gani  and  McNeil  (1971 ) 
discussed  further  aspects  of  ithe  distribution  of  (N',S). 

From  the  results  in  the  Theorem,  various  results  concerning  the 

A  t 

limiting  distribution  of  (A,y)  may  be  derived  A  couple  of  examples 
are  shown  below. 
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(a)  The  expected  values  of  (B/S,D/S)  quickly  "become  complicated. 
Given  N  =  B  +  D,  E(S-"^)  =  (X+y)/(N-l)  so  that 


(X+y)  (N-xq)' 
2 (N-l) 


(X+u) (N+xq) 
2  (N-l) 


and  it  follows  that  for  xQ  =  1,  E(B/S)  =  (x+y)/2  and  for  xQ  =  2 
it  may  be  seen  that 


6y3  +  6y2X  -  3.x2 T-i  +  X3 
12y2 


(b)  The  estimator  X  -  y  of  the  Malthusian  parameter  X  -  y 
is  asymptotically  equal  to  -x^/S,  whose  distribution  is  given  by  the 
density 


u 


X 

^2- (u+x) 


e 


$ 


4c. 

2  0 


(-2xq(uX)' 


/u) , 


u  <  0,  where 


I  is  a  modified  Bessel  function. 


6o  Equidistant  sampling 


Until  now  it.  has  been  assumed  -chat  the  complete  process 

{X^  j  0  <  u  <  t}  was  observed.  It  may  be  more  realistic  to  assume 

that  the  process  is  observed  at  the  equidistant  points  . . . ,kT=t . 

The  observations  X  (sometimes  called  "the  discrete  skeleton")  then 

nx 

form  a  Galton-Watson  process  as  is  well-known  (Harris,  1963,  p.  101), 

but  the  transition  probabilities  are  rather  messy,  and  direct  maximum 

likelihood  estimation  does  not  seem  feasible  (cf.  Darwin  (1956)) . 

In  this  Section,  however,  we  shall  show  that  interesting  results 

may  be  obtained  by  assuming  that  the  following  observations  are  available,, 

Interpret  the  discrete  skeleton  Galton-Watson  process  in  the  usual  way 

as  a  chain  of  generations  of  independently  reproducing  particles,  and 

assume  that  in  addition  to  X  .  itself,  the  number  C  of  particles 

nr  n 

among  the  X^  ^  ^  that  have  0  offspring  is  known. 

Proposition  6,1  Under  the  sampling  scheme  described  above,  the  likelihood 
function  is  proportional  to 

k-1 

ct.  ,o  *ut  ~  ^  xt~xn4Ct 

L(Xsy;  =  a  ui(l-a)(l -&)r  3  "  ,  A  >  0,y  >  0 


where 


k 

and  G  }  -  J  C 
n~l 


The  maximum  likelihood  estimators  of  a  and  3  are  given  by 
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k-l 

a  =  C  /  V  X 
*  0  nT 


and. 


6  -  (Xt  -  *0  +  Ct>/  l  \t- 


Proof.  Let  Z  =  X  _  and  let  C_  =  0.  Then  clearly  {(Z  ,C  ) I n=0  .1 .2 . . . „ } 
-  n  nT  0  n  n  1  5  5  * 

is  a  Markov  jchain  with  stationary-  transition  probabilities 


^  * 

(z,c)  =  P{Zn=z,  Cn=e  |  Zn_x*i ,  given  by  q^z.c)  =  q^Zjc), 

the  i'th  convolution,  q^(z,c)  =  ac(l-a),P_cB^ z-1^1-c^  (l-3)1_c,  and 
q^COjO)  =  1.  The  likelihood  function  is  then  derived  as 

P^Zk=Zk’  Ck=Gk  ^  Zk-l=Zk-l’  Ck-l=ck-l^  P^Z1=z1>C1=c;L  I  Z0=x0,C0=o). 
Theorem  6.1  Assuming  that  only  Xq,X  , . .  . , X^  =  X^  are  observed,  the 

maximum  likelihood  estimator  of  the  Malthusian  growth  parameter  X  -  y 

is  given  by 


)■ 

T  ^0+“--+X(k-l)T/ 


Proof,  ¥e  have  X  -  y  =  x  1  log{(l-a)/(l-3) },  and  the  result  is 

therefore  true  by  Proposition  6.1  if  X  and  C  were  observed. 

nT  n 


But  since  X  -  y  is  a  function  of  the  X  s  only,  the  result  holds 

nT 

in  the  more  narrow  sample. 

This  proof  is  patterned  after  Harris'  (19^8)  derivation  of  the 
maximum  likelihood  estimator  of  the  mean  in  an  unrestricted  offspring 
distribution  of  a  Galton-Watson  process  and  settles  a  question  left 
open  by  Darwin  (1956),  who  studied  as  an  estimator  of  e^_^)T 

without  proving  that  it  is  the  maximum  likelihood  estimator.  Darwin 
obtained  results  concerning  bias  and  asymptotic  variance  of  this  estimator 
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as  well  as  asymptotic  efficiency  relative  to  the-  maximum  likelihood 
estimator  exp{(B  -D, )t/S, }  obtained  from  continuous  observation. 

"C  T3 

E.  R.  Immel  (1951)  also  remarked  in  his  unpublished  UCLA  thesis 
that  estimation  directly  from  the  discrete  skeleton  is  unfeasible. 

Immel  then  considered  the  problem  of  estimating  the  parametric  function 
9  =  u/i  in  the  restricted  model  with  (logy.  -  log  a)  /  (y-A)  =  T  and 
showed  that  in  this  situation  the  maximum  likelihood  estimator  was 


k-1  k 

given  by  ¥  =  T  X  /  J  X  .  . 

g  il'i  j  ilx 


Immel  proved  consistency  and  asymptotic 


normality  of  this  estimator  for  large  :  Xq  within  the  restricted  model 
and  proposed  to  use  ¥  as  an  approximation  in  the  general  case. 

In  the  restricted  model  0  =  e'^  ~  ,  so  that  the  maximum 

likelihood  estimator  of  in  the  restricted  model  is  0  . 

From  Theorem.  6„1  we  see,  however,  that  "0  is  in  fact  the  maximum 
likelihood  estimator  of  tn  the  unrestricted  model,  which 

indicates  that  Immel *s  proposal  of  using  ¥  as  an  estimator  of  0 
in  general  should  not  be  followed. 

Comparison  with  permanent  observation.  The  infinitesimal 
discrete  skeleton 

By  applying  Dion’s  (1972)  results  for  the  Galton-Watson  process 
one  may  give  asymptotic  results  as  k  -*  °o  for  the  maximum  likelihood 
estimators  (Tsy")  of  (A,y)  in  this  sampling  situation  as  veil  as 
efficiency  results.  This  was  shown  in  detail  by  Keiding  ( 197^+ )  for 
the  pure  birth  process  and  we  shall  not  give  the  full  details  for  the 
birth-and-death  process . 

We  may,  however,  call  attention  to  the  fact  that  when  k  -»■  °°, 
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T  ■>  0,  kx  t ,  the  likelihood  function  of  the  equidistant  sampling 
situation  approaches  that  of  continuous  observation  given  in  Section 
2,  (When  observation  is ■ continuous ,  the  number  of  deaths  in  any 
interval  is  (almost  surely ).  given  by  the  knowledge  of  the  total 
population  number  at  each  instant  in  that  interval  and,  in  particular, 
-*■  D  .  )  It  may  be  seen  that  not  only  will  the  estimators 
(T,y)  (A,y)  under  this  limiting  process  but  that  also  the  asymptotic 

distributions  are  asymptotically  equal.  As  was  assumed  by  Keiding 
(197*0,  the  infinitesimal  discrete  skeleton  again  yields  an  alternative 
way  of  deriving  the  correct  results,  but  it  should  be  emphasized  that 
this  success  depends  on  the  introduction  of  the  observation  of  Cn 
above . 
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